METHOD OF PRODUCING A RECOMBINANT PROTEIN OF INTEREST, 

AND PROTEIN PRODUCED 

[0001] The invention relates to a novel method for producing large quantities of a protein of 

interest, which can be used directly for structural analyses. The invention also relates to the 

recombinant protein obtained. 

[0002] G protein-coupled receptors (GPCRs) constitute a superfamily of membrane proteins 
characterized by 7 transmembrane domains (TM I to VII) which play an essential role in 
intercellular communication and the reception of sensory signals [1]. 

[0003] With several hundred members identified, GPCRs form the largest structural and 
functional family of membrane receptors. They represent in particular a significant part of the 
human genome known to date (at least 700 receptors, 0.5% of the genome). 
[0004] Expressed at the surface of all the cells of an organism (from yeast to man), they are 
activated by a large variety of extracellular messages (peptides, hormones, lipids, odorous 
molecules, light, nucleotides, nucleosides, taste molecules, etc.). Activation thereof gives rise to 
an intracellular cascade of signals via G proteins and results in a large number of cellular 
responses (for example cell division or shrinkage, neurotransmission). 

[0005] In general, GPCRs are involved in each physiological function. The importance of 
these receptors and the fact that their location in the cell is known makes them ideal targets for 
therapy. And in fact it may be estimated that almost 50% of medicaments on the market act via 
the GPCRs. Many pathologies are the result of GPCR mutations, and their clinical 
manifestations are well known; mention may be made for example of blindness, nephrogenic 
diabetes insipidus, hypothyroidism or hyperthyroidism, precocious puberty, obesity [2]. 
[0006] The discovery that some chemokine receptors are cofactors of infection by the HIV 
virus reinforces the idea that GPCRs are involved in a wide range of pathological situations [3]. 



[0007] These general considerations clearly demonstrate the need to study the functional 
architecture of these receptors, so as to better understand the signal transduction process and the 
dynamics of their interactions with various molecules (ligands or intracellular partners), and to 
develop new pharmacological and therapeutic tools. However, the study of the functional 
architecture of GPCRs using "direct" experimental methods (X-ray crystallography, NMR, mass 
spectrometry) still remains very limited. Just one three-dimensional (3D) structure is currently 
known, that of bovine rhodopsin [4], on account of the very high natural level of expression of 
this receptor in the retina. Knowledge of their functional architecture is thus currently obtained 
using a set of methods involving theoretical methods (modeling), physicochemical methods 
(photolabeling, fluorescence) and biological methods (site-specific mutagenesis, molecular 
pharmacology, knock-out, etc.). 

[0008] These studies are of extreme importance on the industrial and socioeconomic levels, 
given the potential therapeutic applications. 

[0009] However, studying the structure and function of the GPCRs is very difficult for 
various reasons: 

the transmembrane nature of these proteins and their hydrophobicity makes them 
delicate to handle and usually leads to a loss of functionality and to denaturation 
following solubilization; 

it remains very difficult to obtain them in their complete primary sequence. Most 
of the time, they are expressed in truncated form [5]; 

they are expressed in very low quantity (0.01% of membrane proteins), which 
forms an obstacle to purifying them in large quantities; 
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their molecular weight is high (greater than 40 kD), and they are characterized by 
the presence of post-translational modifications (glycosylation, palmitoylation, 
phosphorylation) and particular structural features (disulfide bridges); 

they are multifunctional proteins having domains with different roles: ligand 
binding, G protein activation, allosteric sites, zones involved in their 
regulation/desensitization. 
[0010] It will be easily understood that the critical step which at present forms a real obstacle 
is surely that of obtaining GPCRs in amounts compatible with "direct" structural biology 
approaches, 

[0011] To date, no strategy has been developed for producing them in large quantity and in a 
way which can be generalized to all GPCRs, and which furthermore allows simple purification 
thereof in a functional form. At times, some receptors have been produced in high quantities 
(mg/1 of culture) [6-8], but the methods used cannot be applied to most GPCRs. 
[0012] Within the context of producing a protein in large quantities, GPCRs represent only 
one example of the difficulties encountered when trying to obtain a large quantity of a protein of 
interest. The object of the present invention is to provide a method for producing a large 
quantity of a protein of interest, particularly GPCRs. 

[0013] The inventors have surprisingly demonstrated that the construct of recombinant 
proteins, particularly membrane proteins, and most particularly GPCRs, comprising at least one 
fragment of an alpha-integrin and the protein of interest, makes it possible to obtain recombinant 
proteins capable of being expressed in large quantities. This strategy makes it possible in 
particular to obtain a production of said proteins in a large quantity in microorganisms, 
particularly in bacteria. When the recombinant proteins of the invention are produced in 
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bacteria, they accumulate in the inclusion body of the bacterial cytoplasm. It is then necessary to 
renaturate the proteins of interest so as to obtain them in active form in a quantity compatible 
with direct analysis of their structure, for example by X-ray crystallography or nuclear magnetic 
resonance (NMR). The method of the invention can furthermore permit the production of non- 
truncated proteins, particularly when it is applied to GPCRs. 

[0014] Integrins form a family of receptors which are associated in terms of their structure 
and function and which participate in cell-cell and cell-extracellular matrix interactions. All the 
integrins are in the form of heterodimers of alpha and beta subunits, which are bonded 
noncovalently. Based on their primary sequence, all the alpha-integrins have an N-terminal 
region formed of seven repeated amino acid sequences (repeats I to VII), each comprising 
approximately 60 amino acids. Some alpha subunits include an insertion domain (I domain) of 
around 200 amino acids, located between repeats II and III. The homologies between repeats I 
and VII essentially comprise consensus sequences FG and GAP, corresponding to the 
phenylalanine, glycyl-glycyl, alanyl and prolyl chains, hence their name "FG-GAP repeat". 
[0015] As far as the inventors are aware, the alpha subunit of integrins (also referred to as 
alpha-integrin (oc-integrin) in the text) has never been used to produce recombinant proteins of 
interest in cells other than mammalian cells, and to do so in a quantity that is directly compatible 
with structural analysis of the protein of interest, this requiring a quantity of said protein which 
may be as much as several milligrams. 

[0016] The present invention aims to meet this requirement. 

[0017] Thus, the invention firstly relates to the use of at least one fragment of an alpha- 
integrin in the construct of at least one recombinant protein of interest. The invention also 
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relates to the use of at least one fragment of an alpha-integrin for producing at least one 
recombinant protein of interest. 

[0018] In the present text, the expression "recombinant protein" or "recombinant protein of 
interest" relates to the recombinant protein produced according to the invention. This recom- 
binant protein may in particular comprise the chaining of several (at least two) proteins of 
interest which are fused, and which may optionally be separated by spacer sequences and/or 
cleavage sequences. 

[0019] The expression "protein of interest" relates to the peptide sequence corresponding to a 
protein of interest which it is desired to produce (or which has been produced). 
[0020] Thus, it will be understood that a "recombinant protein" is formed of one or more 
"proteins of interest", optionally separated by spacer sequences and/or cleavage sequences. 
[0021] Fragment of an alpha-integrin will be understood to mean both the complete amino 
acid sequence of the alpha-integrin used and also a partial sequence. The sequence of the alpha- 
integrin which is used may be native or mutated. Preferably, according to the invention, the 
sequence used is a sequence comprising the N-terminal end of the alpha-integrin used, even 
more preferably a sequence corresponding to the N-terminal end of the alpha-integrin used. 
[0022] According to one particular embodiment of the invention, the fragment of the alpha- 
integrin used comprises at least FG-GAP modules IV to VII and a portion of FG-GAP module III 
of the alpha-integrin used. 

[0023] The fragment of the alpha-integrin used may be a fragment of any known alpha- 
integrin. Mention will be made in particular of the integrins al, a2, a3, a4, a5, a6, a7, a8, a9, 
a 10, all, aD, aE, aL, aM, aX, allb or aV. 
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[0024] According to one particular embodiment of the invention, use is made of a fragment 
of 287 amino acids, corresponding to the part of the N-terminal end of alpha-5-integrin which 
extends between positions 231 and 517, according to the numbering which takes account of the 
presence of the signal peptide. If account is not taken of the signal peptide, the fragment which 
can be used in the invention extends from position 190 (G residue) to 476 (G residue) of alpha-5- 
integrin. 

[0025] When use is made of other alpha-integrins, the fragments which can be used 
according to the invention are the fragments homologous to the fragments defined above. For 
example, in the case of otV-integrin, the fragment which can be used according to the invention 
corresponds to the part of the N-terminal end of otV-integrin which extends from position 21 1 (G 
residue) to 495 (G residue) according to the numbering which takes account of the presence of 
the signal peptide. If account is not taken of the signal peptide, the fragment which can be used 
in the invention extends from position 181 (G residue) to 465 (G residue) of ocV-integrin. In the 
case of ocllb-integrin, the fragment which can be used according to the invention corresponds to 
the part of the N-terminal end of oJIb-integrin which extends from position 224 (G residue) to 
508 (Q residue) according to the numbering which takes account of the presence of the signal 
peptide. If account is not taken of the signal peptide, the fragment which can be used in the 
invention extends from position 193 (G residue) to 477 (Q residue) of ocllb-integrin. 
[0026] According to one particular embodiment of the invention, the fragment of the alpha- 
integrin used comprises at least one amino acid sequence selected from the sequences SEQ ID 
No. 1 (fragment of human cc5-integrin), SEQ ID No. 2 (fragment of human V-integrin) and SEQ 
ID No. 3 (fragment of human allb-integrin) in the appended sequence listing. 
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[0027] According to another particular embodiment of the invention, the alpha-integrin 
fragment used comprises at least one amino acid sequence encoded by one of the nucleotide 
sequences selected from the sequences SEQ ID No. 4 (fragment of human ot5-integrin), SEQ ID 
No. 5 (fragment of human V-integrin) and SEQ ID No. 6 (fragment of human otllb-integrin) in 
the appended sequence listing. 

[0028] According to yet another particular embodiment of the invention, it is possible that 
the fragment of alpha-integrin is used in the construct of several (at least two) recombinant 
proteins of interest. In this case, the recombinant proteins will be fused during translation. This 
may prove necessary in the case of a protein of interest in respect of which the construct 
according to the invention does not allow its direct production (refractory protein). It is then 
necessary to couple in tandem the sequence of said refractory protein to a recombinant protein of 
interest which the constructs according to the invention make it possible to produce. Thus, 
according to this particular embodiment of the invention, the construct according to the invention 
will comprise at least one DNA fragment encoding at least one fragment of an alpha-integrin, 
then at least one DNA encoding at least a first recombinant protein of interest and at least one 
DNA encoding at least a second recombinant protein of interest. According to this particular 
embodiment of the invention, the DNA encoding the second protein of interest will be inserted in 
the construct in phase downstream of the DNA sequence encoding the first protein of interest. 
This particular embodiment can be combined with any one of the particular embodiments 
described above. 

[0029] Preferably, according to the invention, the alpha-integrin fragment is located in the 
recombinant protein of interest prepared according to the invention, upstream of the sequence of 
the protein of interest (or proteins of interest) to be produced, that is to say at the N-terminal end 
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of the recombinant protein of interest (or recombinant proteins of interest) which are to be 
constructed and/or produced. 

[0030] The invention also relates to a recombinant protein, characterized in that it comprises, 
fused together, at least one fragment of an alpha-integrin as defined above and at least one 
protein of interest. 

[0031] The protein(s) of interest, which forms (form) part of the recombinant protein of the 
invention, may be any protein which it is desired to produce, particularly a membrane protein, 
more particularly a G protein-coupled receptor (GPCRs). By way of example of the latter, 
mention may be made of vasopressin and oxytocin receptors (Via, V2, OTR), leukotriene 
receptors (BLT1, BLT2, CysLTl, CysLT2), adrenergic receptors (beta-3), cannabinoid receptors 
(CB1), chemokine receptors (CCR5, CXCR4), the angiotensin II ATI receptor, the bradykinin 
B2 receptor. 

[0032] The recombinant protein of the invention (regardless of the embodiment of the 
invention) may furthermore comprise any amino acid sequence which makes it possible to purify 
said protein in a simple manner. Thus, according to one particular embodiment of the invention, 
the recombinant protein may comprise a sequence of 6 histidine residues (6xHIS tag). This 
6xHIS tag may be incorporated in the sequence of the protein with a view to its purification on a 
Ni-NTA (nickel-nitrilotriacetic acid) agarose column. Preferably, this sequence is at the C- 
terminal end of the recombinant protein of the invention. When the recombinant protein 
according to the invention consists of at least two fused proteins, the 6xHIS tag is preferably 
located downstream of the last of the proteins of interest which it is desired to produce. 
[0033] The sequence encoding the recombinant protein may furthermore comprise at least 
one sequence encoding at least one endoprotease cleavage site. 
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[0034] Advantageously, the sequence coding for the last residues of the integrin may be 
mutated to form an endoprotease cleavage site (factor Xa, thrombin), which, following 
expression and purification of the recombinant protein, will make it possible to separate the 
protein of interest from its fusion partner. According to one particular embodiment of the 
invention, the L residue (position 285) may be modified by mutation into an I residue, the E and 
G residues (positions 286 and 287) being preserved. An additional R residue may be introduced 
by mutagenesis. The chain thus formed (IEGR) corresponds to the factor Xa cleavage site which 
cuts the protein after the R residue. 

[0035] In another embodiment, the factor Xa cleavage site can be transformed into a 
thrombin cleavage site. To do this, the I, E and G residues can be replaced by L, V and P 
residues. The R residue is preserved so as to obtain the chain LVPR. Since the integrin 
fragment has been incorporated into the vector at the 3' end by a BamHI site (sequence ggatcc), 
there is thus obtained the sequence ggatcc coding for two residues G and S just after LVPR. The 
LVPRGS chain forms the thrombin cleavage site, which cuts the protein after the R residue. 
[0036] It will thus be understood that, in a more elaborate embodiment, the recombinant 
protein of the invention comprises, from its N-terminal end to its C-terminal end, the alpha- 
integrin fragment comprising the endoprotease cleavage site, the protein(s) of interest and the 
6xHIS tag. 

[0037] In one particular embodiment, the recombinant protein of the invention comprises, 
from its N-terminal end to its C-terminal end, the alpha-integrin fragment comprising the factor 
Xa cleavage site, the protein(s) of interest and the 6xHIS tag. 
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[0038] In another particular embodiment, the recombinant protein of the invention 
comprises, from its N-terminal end to its C-terminal end, the alpha-integrin fragment comprising 
the thrombin cleavage site, the protein(s) of interest and the 6xHIS tag. 

[0039] It may still be necessary, when the recombinant protein according to the invention 
comprises more than one protein of interest which are fused together, that said proteins of 
interest can be separated after synthesis, for example before purification. Thus, it is possible to 
insert, between the different DNA sequences encoding the different proteins of interest, at least 
one DNA sequence encoding an endoprotease cleavage site. It is possible for cleavage sites for 
different endonucleases to be inserted into the same recombinant protein. 

[0040] It may be necessary to make the cleavage of the recombinant protein even more 
effective. In this respect, it is possible to insert into the construct according to the invention a 
sequence encoding a peptide sequence which serves as a spacer arm, preferably located upstream 
of the endoprotease cleavage site. 

[0041] Thus, according to one particular embodiment of the invention, the recombinant 
protein furthermore comprises a peptide sequence serving as a spacer arm, preferably located 
upstream of the endoprotease cleavage site. 

[0042] Therefore, in an even more elaborate embodiment, the recombinant protein of the 
invention comprises, from its N-terminal end to its C-terminal end, the alpha-integrin fragment 
comprising a spacer arm and the endoprotease cleavage site, the protein(s) of interest and the 
6xHIS tag. 

[0043] In one particular embodiment, the recombinant protein of the invention comprises, 
from its N-terminal end to its C-terminal end, the alpha-integrin fragment comprising a spacer 
arm, the factor Xa cleavage site, the protein(s) of interest and the 6xHIS tag. 
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[0044] In another particular embodiment, the recombinant protein of the invention 
comprises, from its N-terminal end to its C-terminal end, the alpha-integrin fragment comprising 
a spacer arm, the thrombin cleavage site, the protein(s) of interest and the 6xHIS tag. 
[0045] According to the invention, the sequence encoding a peptide sequence which serves 
as a spacer arm may be any sequence known to the person skilled in the art which allows a 
sufficient spacing between the endoprotease cleavage site and the protein(s) of interest for the 
cleavage of the recombinant protein to be effective. 

[0046] Preferably, said sequence encoding a peptide sequence serving as a spacer arm is the 
following sequence SEQ ID No. 7: 5' GACCCGGGTGGTGGTGGTGGTGGTGGTGGTGGT 
3' encoding the following peptide sequence SEQ ID No. 8: DPGGGGGGGG. 
[0047] Thus, in a most elaborate embodiment, the recombinant protein of the invention 
comprises, from its N-terminal end to its C-terminal end, the alpha-integrin fragment comprising 
a spacer arm and the endoprotease cleavage site, the protein(s) of interest, separated by one or 
more endoprotease cleavage sites (for example factor Xa or thrombin cleavage sites) and the 
6xHIS tag. 

[0048] The invention also relates to the use of at least one fragment of a nucleotide sequence 
coding for at least one fragment of an alpha-integrin as defined above, in the construct of a 
nucleotide sequence coding for a recombinant protein of interest as defined above. 
[0049] The invention also relates to the use of at least one fragment of a nucleotide sequence 
coding for at least one fragment of an alpha-integrin as defined above, for producing a 
recombinant protein of interest as defined above. 

[0050] The invention also relates to a nucleotide sequence coding for a recombinant protein 
of interest comprising at least one fragment of a nucleotide sequence coding for at least one 
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fragment of an alpha-integrin, as defined above, and a nucleotide sequence coding for at least 
one protein of interest, as defined above. 

[0051] Preferably, the nucleotide sequence coding for at least one fragment of an alpha- 
integrin which can be used according to the invention or is included in the nucleotide sequence 
coding for a recombinant protein of interest according to the invention may be selected from the 
nucleotide sequences SEQ ID No. 4, SEQ ED No. 5 and SEQ ID No. 6 in the appended sequence 
listing. 

[0052] The invention also relates to a vector comprising a nucleotide sequence coding for a 
recombinant protein of interest, as defined above, comprising at least one fragment of a 
nucleotide sequence coding for at least one fragment of an alpha-integrin and a nucleotide 
sequence coding for at least one protein of interest. The vector may be a eukaryotic vector such 
as a plasmid or a virus. The vector may also be any prokaryotic vector such as a plasmid or a 
phage. 

[0053] Preferably, the vector is an expression vector, that is to say a vector capable of 
allowing the transcription and translation of the nucleotide sequence it contains. 
[0054] By way of example, mention may be made of the vectors of the pET family which are 
sold by the company Novagen or those of the pGEX family which are sold by the company 
Amersham Biosciences. 

[0055] The invention also relates to a cell, into which a nucleotide sequence coding for a 
recombinant protein of interest, as defined above, has been introduced. According to one 
particular embodiment of the invention, the sequence has been introduced in the form of a vector 
as defined above. 
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[0056] Cell in this case will be understood to mean both a eukaryotic cell and a prokaryotic 
cell, particularly a bacterium. Any bacterium capable of allowing the expression of a protein 
from a nucleotide sequence may be used according to the invention. By way of example, 
mention may be made of all bacteria which derive from BL21, BL21 star, Rosetta, BLR, 
Origami, Tuner, Novablue, all commercially available. 

[0057] The invention also relates to a method for producing at least one protein of interest, 
characterized in that, in a first step, there is introduced into a cell a nucleotide sequence coding 
for a recombinant protein of interest, as defined above, and in that, in a second step, the cell is 
placed under conditions sufficient for allowing the expression of the recombinant protein of 
interest. 

[0058] The method of the invention may furthermore comprise an additional step during 
which the recombinant protein of interest may be cut by the action of an endoprotease (factor Xa, 
thrombin, for example), at the site created in the last residues of the integrin so as to separate the 
protein of interest from its fusion partner. 

[0059] The method of the invention may also comprise an additional step during which the 
recombinant protein of interest, or the protein(s) of interest separated from its (their) fusion 
partner(s), may be purified. 

[0060] According to the method of the invention, the nucleotide sequence coding for a 
recombinant protein of interest may be introduced into the cell by any known method. By way 
of example of methods which can be used, it is possible to mention, in respect of prokaryotic 
cells, heat shock or electroporation. In respect of eukaryotic cells, mention may be made of 
electroporation, the calcium phosphate precipitate method, the use of cationic polymers such as 
DEAE-dextran, or any method using cationic liposomes or activated dendrimers. It is also possi- 
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ble to use retroviruses to carry out gene transfer, and also techniques using microprojectiles to 
deliver DNA to target cells. 

[0061] Likewise, any sufficient condition known to the person skilled in the art which allows 
the expression of the recombinant protein of interest can be used according to the method of the 
invention. 

[0062] Finally, any method of purifying the protein(s) which is known to the person skilled 
in the art can be used according to the method of the invention. By way of example, mention 
may be made of the methods of affinity chromatography, ion exchange chromatography, 
hydrophobic interaction chromatography or filtration using a molecular sieve. 
[0063] In particular, when the recombinant protein of interest comprises the 6xHIS tag, 
purification on a nickel-nitrilotriacetic acid (Ni-NTA) agarose column represents one method of 
purification which is particularly satisfactory within the context of the method of the invention. 
[0064] The techniques which can be used according to the invention are known to the person 
skilled in the art. The latter can refer to the numerous manuals which are available, and in 
particular to "Molecular Cloning, a laboratory manual. 2nd edition, Sambrook, Fritsch, Maniatis 
eds., CSH laboratory press, (1989)". 

[0065] Besides the above provisions, the invention also comprises other provisions which 
will emerge from the following description which refers to examples of embodiments of the 
invention and also to the appended drawings, in which: 

Fig. 1 shows a construct corresponding to a vector according to the invention. 

Fig. 2 shows the production of the <x5-integrin/vasopressin V2 receptor fusion protein 
according to the method of the invention (left-hand column: molecular weight of the proteins of 
the marker sample; arrow: position of the o5-integrin/vasopressin V2 receptor recombinant 
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protein, NI: proteins of a non-induced sample, 2h, 3h and 4h: proteins of an induced sample 
after 2h, 3h and 4h of induction). 

Fig. 3 shows the a5-integrin/vasopressin V2 receptor recombinant protein of Fig. 2 after 
purification and migration on electrophoresis gel (left-hand column: molecular weight of the 
proteins of the marker sample; arrow: position of the a5-integrin/vasopressin V2 receptor 
recombinant protein). 

Fig. 4 shows the result of purification of the a5-integrin/vasopressin V2 receptor CXCR4 
(a5-V2-CXCR4) recombinant fusion protein by means of affinity chromatography. 

S6M: supernatant of solubilization in 6M urea buffer, deposited on Ni-NTA agarose 

resin 

FT: sample not held on the resin 

W: fraction wash containing 15 mM imidazole 

E100: purified fusion eluted in a buffer containing 100 mM imidazole 

[0066] The arrow indicates the position of the oc5-V2-CXCR4 fusion protein. 

[0067] The following examples illustrate the invention and do not limit it in any way. 

Example 1 : Construction of a vector which allows the expression in bacteria of a recombinant 
protein of interest: 

[0068] A complementary DNA coding for the protein of interest which it is desired to 
express is positioned in the vector pET21a (+) (sold by the company Novagen) in phase with a 
fragment of complementary DNA of a5-integrin, by using appropriate restriction sites. The a5- 
integrin fragment is delimited by the Ndel and BamHI sites. The Ndel site has the advantage of 
incorporating an ATG codon which is the translation initiator codon. This initiator codon codes 
for a methionine (M). The Ndel site formed by the sequence CATATG is thus of interest for 
subcloning a DNA fragment since the target sequence is located directly in phase with the ATG 
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codon. The latter then forms residue 1. With regard to alpha5-integrin 5 the ATG of the Ndel site 
is positioned upstream of its nucleotide sequence. In this case, the ATG will code for an Ml and 
the G of the integrin will be residue 2. The fragment of 287 residues will be coupled to 
methionine 1 and a fusion partner of 288 residues is thus obtained: M1-G288. 
[0069] The vector directly provides the sequence coding for the 6xHIS tag which will be 
located at the C-terminal end of the recombinant protein of interest. An EcoRI site is located in 
the vector at the N-terminal end of the tag site. Thus, the complementary DNA coding for the 
protein of interest is inserted between the BamHI site marking the C-terminal end of the 
complementary DNA fragment of the a5 -integrin and the EcoRI site located at the N-terminal 
end of the 6xHIS tag. 

[0070] Fig. 1 shows the diagram of such a construct. 
Example 2: Expression of the human vasopressin V2 receptor: 
Construction of the vector: 

[0071] The complementary DNA of the human vasopressin V2 receptor (Cotte et al., J. 
BIOL. Chem. 273, 29462-29468, 1998) is inserted between the BamHI and EcoRI sites of the 
vector obtained in Example 1 . 

Step 1 : Preparation of the complementary DNA of the human vasopressin V2 receptor: 
[0072] Recognition sites for the restriction enzymes BamHI and EcoRI are added on either 
side of the complementary DNA sequence of the human vasopressin V2 receptor. This is done 
using the conventional PCR technique. The complementary DNA of the human vasopressin V2 
receptor is amplified from the vector pRK5-V2 (Cotte et al., J. BIOL. Chem. 273, 29462-29468, 
1998) with the aid of two primer oligonucleotides which make it possible to insert the desired 
restriction sites: 
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sense oligo (allows the incorporation of the BamHI site): 5' ATG GGT CGC 
GGA TCC ATG CTC ATG GCG TCC ACC ACT TCC 3' 

antisense oligo (allows the incorporation of the EcoRI site): 5' CGA CGG AAT 
TCT GCG ATG AAG TGT CCT TGG CCA G 3'. 
[0073] The PCR reaction is carried out in 50 microliters of a reaction mixture comprising: 



- pRK5-V2 20 ng 
-sense oligo 100 ng 

- antisense oligo 1 00 ng 

- Pfii Turbo polymerase (Stratagene) 2.5 U 

- 10X Pfa buffer (Stratagene) 5 |xl 



- dNTP 80 nM final for each of the 4 
according to the following cycle parameters: 

- initial denaturation at 95 °C for 2 minutes, then 

- 25 cycles: 95°C, 30 seconds then 55°C, 1 minute, 72°C, L5 minutes, then 

- final elongation at 72°C for 10 minutes. 

[0074] The presence of the amplified fragment (amplified PCR V2 fragment) is checked on 
1% agarose gel. 

Step 2: Purification of the amplified fragment (amplified PCR V2 fragment): 
[0075] The zone of the agarose gel in which the amplified DNA fragment is visualized is cut, 
and the cDNA is purified using the purification kit Qiaquick gel extraction kit (Qiagen reference 
28706), adhering strictly to the protocol recommended by the supplier. 
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Step 3: Cutting of the amplified PCR V2 fragment by the enzymes BamHI and EcoRI: 

[0076] This is carried out in a single step using the enzymes sold by New England Biolabs 

(NEB) by incubation for 3 hours at 37°C, in a final volume of 50 microliters containing: 



- V2 insert (amplified fragment) 100 to 200 ng 24 (il 

- 1 OX EcoRI buffer NEB 5 ^il 

- bovine serum albumin NEB 100X (10 mg/ml) 1 ^1 

- EcoRI (40 U) 2 jil 

- BamHI (40 U) 2 jil 
-water 16 jal 



[0077] At the end of the reaction, the two enzymes are inactivated by heating at 80°C for 20 
minutes. 

[0078] The PCR V2 fragment is then purified using a 1% agarose gel according to the 
protocol described above. 

Step 3: subcloning of the amplified PCR V2 fragment in the BamHI and EcoRI sites of the 
vector pET2 1 a of Example 1 : 

[0079] Ligation is carried out by incubating at ambient temperature (20-25°C) for 4 hours in 
a medium comprising: 

- BamHI/EcoRI PCR V2 fragment (100 to 200 ng) 8 \il 



- vector pET2 1 a (30 ng) cut by BamHI/EcoRI 3 |il 

- 10X ligase buffer (NEB) 2.5 \i\ 

- T4 DNA ligase (NEB) 2 jxl 

- water 9.5 
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[0080] The ligation product, the integrin/human vasopressin V2 receptor fusion protein, is 
then used for transformation of Rosetta bacteria (DE3) in order to carry out the receptor 
expression tests. 

[0081] Introduction of the expression vector into a bacterium and expression of the protein of 
interest: 

Transformation: 

[0082] The vector obtained above is then introduced into a bacterium of the Rosetta strain 
(DE3) using the heat shock technique, following the transformation protocol recommended by 
the supplier, in this case Novagen. 

[0083] 20 nl of Rosetta bacteria (DE3), Novagen reference 70954-4, and 1 \il of pET21a- 
integrin/V2 (a few nanograms) are incubated on ice for 30 minutes, then kept at 42°C for 30 
seconds and then again on ice for 2 minutes in order to perform a heat shock. 
[0084] 80 \x\ of SOC medium (see composition in Molecular Cloning, a laboratory manual. 
2nd edition, Sambrook, Fritsch, Maniatis eds. CSH laboratory press, 1989) are then added, then 
the whole is incubated at 37°C for 1 hour with stirring at 300 rpm. 

[0085] The incubation medium is then spread onto Petri dishes containing LB agar + 
ampicillin at 100 micrograms/ml. The dishes are incubated at 37°C for 16 hours. The bacteria 
of a colony are then cultured at 37°C in 10 ml of LB medium containing 100 |!g/ml of ampicillin 
(or of its analog, carbenicillin), and the cell suspension is stirred at 300 rpm. 
Expression of the protein: 

[0086] When the optical density of the culture reaches 0.6 U, expression of the recombinant 
protein is induced by adding 1 mM IPTG. 
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[0087] Samples are taken 2, 3 and 4 hours after induction. To do this, 1 ml of bacterial 
suspension, with an optical density of 0.6, is taken from each culture. The sample is centrifuged 
for 2 minutes at 12000 rpm. The supernatant is removed and the pellet is resuspended in 60 nl of 
lysis buffer (25 mM Tris, pH 8.3, 185 mM glycine, 0.1% SDS). 60 of SDS buffer (10% 
glycerol, 5% 2-mercaptoethanol, 25 mM Tris-HCl, pH 6.5, 8% SDS, bromophenol blue (a few 
grains)) are then added and 10 jil of the lyzed sample (total protein extracts) are then deposited 
on acrylamide/bis-acrylamide 12% - SDS 0.1% gel. Following migration of the proteins, the 
latter are stained with Coomassie blue according to conventional techniques. 
[0088] Fig. 2 shows the results obtained. The induced samples are compared with controls 
which have not been induced (NI) but which have been cultured for an equivalent time. It can be 
seen that the o5-V2 receptor fusion protein, which has an apparent molecular weight of around 
65 kDa, is one of the majority proteins of the bacterium, this being a condition which is 
necessary for purification of the receptor in a quantity compatible with analyses of its structure 
using NMR or crystallography approaches. 

[0089] 1 ml of culture thus formed made it possible to obtain around 3 ^g of vasopressin 
receptor. 

Example 3 : Expression of other receptors: 

[0090] The result obtained in Example 2 was reproduced with the same effectiveness for 
other GPCRs, such as the /?3 -adrenergic receptor, the BLT2, Cys-LTl and Cys-LT2 receptors of 
leukotrienes LTB4, LTD4 and LTC4, the cannabinoid receptor type 1, the vasopressin Via 
receptor and the oxytocin receptor. 
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Example 4: Purification of the a5-integrin fragment- vasopressin V2 receptor fusion protein 
obtained in Example 2: 

[0091] The method used is that described by Porath J. et coll. (Metal chelate affinity 
chromatography, a new approach to protein fractionation. Nature 258, 598-599, 1975). 
[0092] The example described here is a test which made it possible to purify 3 mg of fusion 
protein from a bacterial culture of 100 ml. 

[0093] A colony isolated on LB agar + ampicillin (100 ^ig/ml) is pricked and cultured in 10 
ml of culture medium LB + carbenicillin (100 ng/ml). Culturing is carried out at 37°C, with 
stirring at 300 rpm. When the optical density of the culture reaches 0.6, culturing is stopped and 
the culture is kept in the refrigerator (this sample is called the preculture). The next day, in a 500 
ml Erlenmeyer, 100 ml of culture medium LB + carbenicillin (100 \ig/m\) are seeded with 2 ml 
of preculture and left at 37°C, at 300 rpm, until the optical density of the culture has reached 0.6. 
0.1 mM IPTG is then added to the culture so as to induce expression of the recombinant protein. 
Culturing is continued for around 3 hours, until an optical density of 2.4 is obtained (stimulation 
factor of 4). 

[0094] The culture is then centrifiiged at 4000 rpm for 5 minutes. The supernatant is 
removed and the pellet can be lyzed directly or kept at -80°C. 

[0095] For lysis, the pellet is taken up by homogenization using a pipette in 6 ml of Tris-HCl 
20 mM, pH 8.00 + protease inhibitors (leupeptin 5 ng/ml; benzamidine 10 )ig/ml and PMSF 10 
|ig/ml). These three protease inhibitors will be incorporated in all the buffers used hereafter. 
[0096] The bacteria are lyzed by sonication using a Branson conical microprobe (duty cycle 
50%, output control 5, frequency 1 burst per second for 30 seconds, then rest for 30 seconds; this 
cycle is repeated 5 times). The tube is kept in ice during the sonication. The medium is then 
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centrifuged for 30 minutes at 15000 rpm at 4°C. The supernatant is kept for control on 
electrophoresis gel. 

[0097] The pellet contains the protein of interest since the latter has accumulated in the 
inclusion body. 

[0098] The pellet is taken up by homogenization using a pipette in 5 ml of Tris-HCl 20 mM, 

pH 8.00. The lysis and centrifugation steps are repeated once. 

[0099] The centrifugation supernatants are kept for control on electrophoresis gel. 

[0100] The pellet is taken up by homogenization using a pipette in 5 ml of Tris-HCl, pH 

8.00, 1M urea. A magnetic bar is placed in the sample and the latter is stirred gently for lh30. 

The tube is kept in ice during this step, which corresponds to washing of the inclusion body and 

makes it possible to remove membrane proteins or cytoplasmic proteins which are associated 

with the inclusion bodies but which are considered as contaminants with respect to the 

recombinant protein. 

[0101] The whole is then centrifuged at 15000 rpm for 30 minutes at 4°C. The supernatant is 
kept for control on electrophoresis gel. 

[0102] In order to solubilize the inclusion bodies and thus the protein of interest, the pellet is 
taken up by homogenization using a pipette in 5 ml of Tris-HCl 20 mM, pH 8.00, 6M urea, SDS 
0.2%. 

[0103] A magnetic bar is placed in the sample and the latter is stirred gently for 3 hours, in 
ice. 

[0104] The protein of interest (the a5-integrin fragment-vasopressin V2 receptor fusion 
protein) is then completely denatured. 
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[0105] The whole is then centrifuged at 15000 rpm for 30 minutes at 4°C. The supernatant 
contains the protein of interest and constitutes the sample which will be brought into contact with 
the Ni-NTA (nickel-nitrilotriacetic acid) resin so as to purify the alpha5-V2 fusion by means of 
affinity chromatography. 

[0106] 3 ml of Superflow Ni-NTA agarose resin (Qiagen, ref. 30430) are equilibrated in 
Tris-HCl 20 mM, pH 8.00, 6M urea, SDS 0.2%, NaCl 150 mM, imidazole 5 mM. A sufficient 
amount of NaCl and imidazole are added to the sample containing the protein of interest so as to 
obtain a final concentration of 150 mM of NaCl and 5 mM of imidazole. The sample and the 
resin are brought into contact and are left to incubate at 4°C for 16 hours with gentle stirring. 
The sample/resin mixture is deposited in a plastic column and left. After settling, the "flow- 
through" fraction is recovered at a low flow rate, for control on electrophoresis gel. 
[0107] The resin is then washed with 3 x 9 ml of a solution of Tris-HCl 20 mM, pH 8.00, 6M 
urea, SDS 0.2%, NaCl 150 mM, imidazole 20 mM, to remove all the proteins not specifically 
held on the nickel groups. The wash eluates are kept for control on electrophoresis gel. 
[0108] The protein of interest is then detached from the resin by passing through 3 ml of a 
solution of Tris-HCl 20 mM, pH 8.00, 6M urea, SDS 0.2%, NaCl 150 mM, imidazole 100 mM. 
An aliquot of the purified protein is kept for control on electrophoresis gel. 
[0109] 10 Jul of the medium containing the purified protein are mixed with 10 jxl of SDS 
buffer and the whole is deposited on an electrophoresis gel. 

[0110] 10 |il of the purified sample contain from 5 to 10 \ig of protein, that is to say that 1.5 

to 3 mg of recombinant protein are contained in 3 ml of eluate. 

[0111] Fig. 3 shows the purified protein deposited on electrophoresis gel. 
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[0112] The purified sample is dialyzed against a solution of Tris-HCl 20 mM, pH 8.00, 6M 
urea, NaCl 150 mM so as to remove the SDS and the imidazole. To do this, the sample is placed 
in a Pierce dialysis cassette (membrane of 10000 MWCO) and dialysis is carried out in a beaker 
containing one liter of buffer. The dialysis is carried out at 4°C for at least 24 hours. 
[0113] The sample is recovered and the amount of protein of interest which is obtained is 
metered by measuring the absorption (excitation at 280 nM, absorption between 235 and 500 
nM). In general, an O.D. of 1 to 1.5 is obtained, which is equivalent to a concentration of 0.5 to 
1 mg/ml, which is equivalent to a concentration of around 10 \xM. 

[0114] The protein which has been purified and denatured (since it has been solubilized in 
6M urea) is used for the renaturation tests. 

Example 5: Construction of a vector which allows the simultaneous expression of two 
proteins of interest (in this case two GPCRs) in Escherichia coli and their purification. 

[0115] A complementary DNA coding for a protein of interest, in this case the human 

chemokine receptor CXCR4, is inserted in the vector pET21a(+)-oc5V2 described above in 

Example 2. This DNA must be in phase with that coding for the c6V2 fusion and is positioned 

between the SacI and Hindlll restriction sites for example. The vector directly supplies the 

sequence coding for the 6xHIS tag which will thus be located at the C-terminal end of the 

receptor CXCR4 and will therefore allow its purification in a subsequent step. 

[0116] In the example, an optimized ("bacterialized") version of the CXCR4 is inserted in 

the vector, but the natural eukaryotic version of this receptor (Herzog H, Hort YJ, Shine J and 

Selbie LA. Molecular cloning, characterization and localization of the human homolog to the 

reported bovine NPY Y3 receptor: lack of NPY binding and activation. DNA Cell Biol. 12, 

465-471, 1993) can also be used in the same way. 
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Step 1 : Preparation of the complementary DNA of the human receptor CXCR4. 
[0117] Recognition sites for the restriction enzymes Sad and Hindlll are added on either 
side of the sequence coding for the human receptor CXCR4 during a conventional PCR reaction. 
The complementary DNA of this receptor is amplified from the vector pETlOl/D-TOPO 
(Invitrogen) in which it is subcloned and from two primer oligonucleotides which make it 
possible to insert the restriction sites in question. 

Sense oligo (incorporation of the Sad site): 5' CGAGCTAAGGC GAGCTC A 

ATGGAAGGCATTAGCATTTATAC 3' 

Antisense oligo (incorporation of the Hindlll site): 5 5 CGACGGCCC AAGCTT 

GCTGCTATGAAAGCTGCTGCTTTC 3' 
[0118] The PCR reaction is carried out in 50 microliters and is composed of: 



pETlOl/D-TOPO 20 ng 

sense oligo 100 ng 

antisense oligo 1 00 ng 

Pfu Turbo polymerase (Stratagene) 2.5 U 

lOXPfu buffer 5 \i\ 



dNTP 80 final for each of the 4 
[0119] The reaction parameters are: 

initial denaturation at 95°C for 2 minutes, then 25 cycles: 95°C, 30 s; 55°C, 1 
min; 72°C, 1.5 min, then final elongation at 72°C, 10 min. 
[0120] The presence of the amplified fragment is checked on 1% agarose gel. 
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Step 2: Purification of the amplified fragment 

[0121] The zone of the agarose gel in which the amplified DNA fragment is visualized is cut, 

and the cDNA is purified using the purification kit Qiaquick gel extraction kit (Qiagen reference 

28706), adhering strictly to the protocol recommended by the supplier. 

Step 3: Cutting of the amplified PCR CXCR4 fragment by Sad and Hindlll: 

[0122] This is carried out in two successive reactions using restriction enzymes sold by NEB 



Biolabs during incubation at 37°C. 
1st reaction for 3h: 

PCR fragment 500 ng 

10X NEB buffer 1 5 \il 

bovine serum albumin (BSA) 1 \i\ 

Sad 40 U 

water qs 50 ^1 



[0123] The PCR fragment which has been amplified and cut by SacI is purified using an 
agarose gel according to the protocol described in step 2. 
2nd reaction for 3h: 

PCR fragment recovered from the previous reaction 



10X NEB buffer 2 5 \il 

BSA 1 jil 

Hindlll 40 U 

water qs 50 ^1 



[0124] The PCR fragment which has been amplified and cut by Sacl/Hindlll is purified 
using an agarose gel according to the protocol described in step 2. 
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Step 4: subcloning of the PCR CXCR4 fragment which has been amplified and cut by 
Sacl/Hindlll in the vector pET21a(+)-c6V2: 

[0125] This step is carried out by ligation at ambient temperature for 16h00. 
cut PCR fragment 200 ng 

vector pET21a(+)-o5V2 cut by the same enzymes 50 ng 
10X ligase buffer (NEB) 2.5 p.1 

T4 DNA ligase (NEB) 2 jal 

water qs 25 \il 

[0126] The vector obtained, which codes for a triple fusion protein, integrin-V2 receptor- 

CXCR4 receptor, is then used for a transformation of Rosetta bacteria (DE3) for the purpose of 

expressing this fusion and purifying it. 

Step 5: transformation of the Rosetta bacteria 

[0127] Follow to the letter the protocol described in Example 2. 

Step 6: expression of the a5V2-CXCR4 fusion 

[0128] Follow to the letter the protocol described in Example 2, but the LB culture medium 
is replaced by Hyperbroth medium (Athena Enzyme Systems) and the optimal induction time is 
4 hours. 

Step 7: purification of the ot5V2-CXCR4 fusion. This step is shown in Fig. 4. 
[0129] Follow to the letter the protocol of Example 4, but, during the step of washing the Ni- 
NTA agarose resin, a concentration of 15 mM of imidazole instead of 20 mM is used in the wash 
solution. Elution is carried out to 100 mM as in Example 4. 

[0130] The spacer arm can also be inserted upstream of the thrombin cleavage site just after 
the EcoRI site. 
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